Festvox: Tools for Creation and Analyses of Large Speech Corpora
نویسندگان
چکیده
This paper summarises the tools provided within Festvox[1], a freely available software suite for creation and analyses of large scale speech corpora for enabling research, development and instruction in speech technologies.
منابع مشابه
Diphone collection and synthesis
In this paper, we describe the design and collection of corpora for diphone synthesis, the voice building process, and our experience in the creation of a new, publically available database of ten diphone sets of one American English speaker for the Festival Speech Synthesis System [3], using the FestVox document and tools [1]. In support of our goal to make the tools and techniques available f...
متن کاملDevelopment of a southern Swedish clustergen voice for speech synthesis
This paper describes the development of a speech synthesis voice with a southern Swedish accent. The voice is built for the Festival speech synthesis system using the tools in the festvox suite. The voice type is clustergen, which is a statisticalparametrical synthesis method where parametrical models for phonemes, duration and pitch all are built from a labeled speech d...
متن کاملTBT (Toolkit to Build TTS): A High Performance Framework to Build Multiple Language HTS Voice
With the development of high quality TTS systems, application area of synthetic speech is increasing rapidly. Beyond the communication aids for the visually impaired and vocally handicap, TTS voices are being used in various educational, telecommunication and multimedia applications. All around the world people are trying to build TTS voice for their regional languages. TTS voice building requi...
متن کاملQuerying Annotated Speech Corpora
This paper is concerned with querying annotated speech corpora. A growing number of such corpora is currently being created worldwide; however, their usefulness for a wider research community is restricted by the lack of standard tools for creating, editing, annotating, storing and querying them. Two solutions for these problems are presented here: the XML-based data format TASX for corpus crea...
متن کاملIRCAM Corpus Tools: Managing speech corpora
Corpus based methods are increasingly used for speech technology applications and for the development of theoretical or computer models of spoken languages. These usages range from unit selection speech synthesis to statistical modeling of speech phenomena like prosody or expressivity. In all cases, these usages require a wide range of tools for corpus creation, labeling, symbolic and acoustic ...
متن کامل